Generalize FakeQuantizer beyond intx #2714

andrewor14 · 2025-08-07T21:13:44Z

Stack from ghstack (oldest at bottom):

-> Generalize FakeQuantizer beyond intx #2714

Summary: Similar to #2628,
but for FakeQuantizer. It is cleaner to isolate the logic of
each quantizer in separate classes, e.g. intx vs nvfp4 vs fp8.
Naming change:

FakeQuantizer -> IntxFakeQuantizer

BC-breaking notes: This is technically not BC-breaking yet
since we are just deprecating the old APIs while keeping them
around. It will be when we do remove the old APIs in the future
according to #2630.

Before:

config = IntxFakeQuantizeConfig(torch.int8, "per_channel")
FakeQuantizer(config)

After:

config = IntxFakeQuantizeConfig(torch.int8, "per_channel")
IntxFakeQuantizer(config) # or
FakeQuantizerBase.from_config(config)

Test Plan:

python test/quantization/test_qat.py

**Summary:** Similar to #2628, but for `FakeQuantizer`. It is cleaner to isolate the logic of each quantizer in separate classes, e.g. intx vs nvfp4 vs fp8. Naming change: ``` FakeQuantizer -> IntxFakeQuantizer ``` **BC-breaking notes:** This is technically not BC-breaking yet since we are just deprecating the old APIs while keeping them around. It will be when we do remove the old APIs in the future according to #2630. Before: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") FakeQuantizer(config) ``` After: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") IntxFakeQuantizer(config) # or FakeQuantizerBase.from_config(config) ``` **Test Plan:** ``` python test/quantization/test_qat.py ``` [ghstack-poisoned]

pytorch-bot · 2025-08-07T21:13:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2714

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job

As of commit 33a8305 with merge base 246b142 ():

NEW FAILURE - The following job has failed:

Run TorchAO Experimental Tests / test-cpu-ops (macos-14) (gh)
Process completed with exit code 1.

CANCELLED JOB - The following job was cancelled. Please retry:

Run TorchAO Experimental Tests / test-cpu-ops (linux.arm64.2xlarge) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

**Summary:** Similar to #2628, but for `FakeQuantizer`. It is cleaner to isolate the logic of each quantizer in separate classes, e.g. intx vs nvfp4 vs fp8. Naming change: ``` FakeQuantizer -> IntxFakeQuantizer ``` **BC-breaking notes:** This is technically not BC-breaking yet since we are just deprecating the old APIs while keeping them around. It will be when we do remove the old APIs in the future according to #2630. Before: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") FakeQuantizer(config) ``` After: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") IntxFakeQuantizer(config) # or FakeQuantizerBase.from_config(config) ``` **Test Plan:** ``` python test/quantization/test_qat.py ``` ghstack-source-id: 3867fab Pull Request resolved: #2714

jerryzh168 · 2025-08-07T21:55:28Z

if not BC breaking, probably don't need to a bc-breaking note, seems like more of a deprecation note

**Summary:** Similar to #2628, but for `FakeQuantizer`. It is cleaner to isolate the logic of each quantizer in separate classes, e.g. intx vs nvfp4 vs fp8. Naming change: ``` FakeQuantizer -> IntxFakeQuantizer ``` **BC-breaking notes:** This is technically not BC-breaking yet since we are just deprecating the old APIs while keeping them around. It will be when we do remove the old APIs in the future according to #2630. Before: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") FakeQuantizer(config) ``` After: ``` config = IntxFakeQuantizeConfig(torch.int8, "per_channel") IntxFakeQuantizer(config) # or FakeQuantizerBase.from_config(config) ``` **Test Plan:** ``` python test/quantization/test_qat.py ``` [ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 7, 2025

andrewor14 requested review from jerryzh168 and drisspg August 7, 2025 21:14

andrewor14 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Aug 7, 2025

jerryzh168 added topic: bc-breaking Use this tag if this PR breaks backward compatibility and removed topic: bc-breaking Use this tag if this PR breaks backward compatibility labels Aug 7, 2025

jerryzh168 approved these changes Aug 7, 2025

View reviewed changes

andrewor14 changed the base branch from gh/andrewor14/17/base to main August 8, 2025 15:55

andrewor14 merged commit 6cfa477 into main Aug 8, 2025
39 of 43 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generalize FakeQuantizer beyond intx #2714

Generalize FakeQuantizer beyond intx #2714

Uh oh!

andrewor14 commented Aug 7, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 7, 2025 •

edited

Loading

Uh oh!

jerryzh168 commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!

Generalize FakeQuantizer beyond intx #2714

Generalize FakeQuantizer beyond intx #2714

Uh oh!

Conversation

andrewor14 commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2714

❌ 1 New Failure, 1 Cancelled Job

Uh oh!

jerryzh168 commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!

andrewor14 commented Aug 7, 2025 •

edited

Loading

pytorch-bot bot commented Aug 7, 2025 •

edited

Loading